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A NOVEL METHOD OF DIAGNOSING, 
MONITORING , AND STAGING COLON CANCER 

FIELD OF THE INVENTION 

This invention relates, in part, to newly developed 
assays for detecting, diagnosing, monitoring, staging, and 
prognosticating cancers, particularly colon cancer. 

BACKGROUND OF THE INVENTION 

Colon cancer is the second most frequently diagnosed 
malignancy in the United States. Cancer of the 

gastrointestinal tract, especially colon cancer, is a highly 
treatable and often a curable disease when localized to the 
bowel. However, currently colon cancer is the second most 
common cause of cancer death. Surgery is the primary 
treatment and results in cure in approximately 50% of 
patients. Recurrence following surgery is a major problem and 
often is the ultimate cause of death. The prognosis of colon 
cancer is clearly related to the degree of penetration of the 
tumor through the bowel wall and the presence or absence of 
nodal involvement. These two characteristics form the basis 
for all staging systems developed for this disease. Bowel 
obstruction and bowel perforation are indicators of poor 
prognosis. Elevated pretreatment serum levels of 

carcinoembryonic antigen (CEA) and carbohydrate antigen 19-9 
(CA 19-9) also have negative prognostic significance. 

Because of the frequency of the disease (approximately 
160,000 new cases of colon cancer per year), the 
identification of high-risk groups, the demonstrated slow 
growth of primary lesions, the better survival of early-stage 
lesions, and the relative simplicity and accuracy of screening 
tests, screening for colon cancer should be a part of routine 
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care for all adults starting at age 50, especially those with 
first-degree relatives with colorectal cancer. 

Procedures used for detecting, diagnosing, monitoring, 
staging, and prognosticating colon cancer are of critical 
importance to the outcome of the patient. For example, 
patients diagnosed with early colon cancer generally have a 
much greater five-year survival rate as compared to the 
survival rate for patients diagnosed with distant metastasized 
colon cancer. Treatment decisions are usually made in 
reference to the older Dukes or the Modified Astler-Coller 
(MAC) classification schema for staging. However, new 
diagnostic methods which are more sensitive and specific for 
detecting early colon cancer are clearly needed. 

Further, colon cancer patients must be closely monitored 
following initial therapy and during adjuvant therapy to 
determine response to therapy and to detect persistent or 
recurrent disease of metastasis. Thus, there is clearly a 
need for a colon cancer marker which is more sensitive and 
specific in detecting colon cancer recurrence. 

Another important step in managing colon cancer is to 
determine the stage of the patient's disease. Stage 
determination has potential prognostic value and provides 
criteria .for designing optimal therapy. Currently, 
pathological staging of colon cancer is preferable over 
clinical staging as pathological staging provides a more 
accurate prognosis. However, clinical staging would be 
preferred were the method of clinical staging at least as 
accurate as pathological staging because it does not depend 
on an invasive procedure to obtain tissue for pathological 
evaluation. Staging of colon cancer would be improved by 
detecting new markers in cells, tissues, or bodily fluids 
which could differentiate between different stages of 
invasion. 

In the present invention, methods are provided for 
detecting, diagnosing, monitoring, staging, and 
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prognosticating colon cancers, particularly colon, stomach, 
L small intestine cancer, via nine ,9) Colon Specific Genes 
(CSGS) . The nine CSGs refer, among other things, to native 
proteins expressed by the genes comprising the polynucleotide 
sequences of any of SEQ ID N0:1, 2, 3, 4, 5, 6, 7, or 9 
in the alternative, what is meant by the nine CSGs as used 
herein, means the native mRNAs encoded by the genes comprising 
any of the polynucleotide sequences of SEQ ID NO: 1, 3. 4. 
5 6 7 8 or 9 or levels of the genes comprising any of the 
polynucleotide sequences of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8 
or 9 . 

Other objects, features, advantages and aspects of the 

present invention will become apparent to those of .kill in 

the art from the following description. It should be 

understood, however, that the following description and the 

specific examples, while indicating preferred embodiments of 

the invention, are given by way of illustration only. Various 

changes and modifications within the spirit and scope of the 

disclosed invention will become readily apparent to those 

skilled in the art from reading the following description and 

„i. narte of the present disclosure, 

from reading the other parts oi i« v 

SUMMARY OF THE INVENTION 

Toward these ends, and others, it is an object of the 
present invention to provide a method for diagnosing the 
presence of colon cancer in a patient which comprises 
measuring levels of CSG in a sample of cells, tissue or bodily 
fluid from the patient and comparing the measured levels of 
CSG with levels of CSG in preferably the same cells, tissue, 
or bodily fluid type of a control, wherein an increase in the 

, n,. natient versus levels of CSG in the 
measured CSG levels in the patient 

control is associated with colon cancer. 

Another object of the present invention is to provide 
a method of diagnosing metastatic colon cancer in a patient 
which comprises measuring CSG levels in a sample of cells. 
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tissue, or bodily fluid from the patient and comparing the 
measured CSG levels with levels of CSG in preferably the same 
cells, tissue, cr bodily fluid type of a control, wherein an 
increase in measured CSG levels in the patient versus levels 
of CSG in the control is associated with a cancer which has 
metastasized. 

Another object of the present invention is to provide 
a method of staging colon cancer in a patient which comprises 
identifying a patient having colon cancer, measuring levels 
of CSG in a sample of cells, tissues, or bodily fluid obtained 
from the patient, and comparing the measured CSG levels with 
levels of CSG in preferably the same cells, tissue or bodily 
fluid type of a control. An increase in measured CSG levels 
in the patient versus CSG levels in the control can be 
associated with a cancer which is progressing while a decrease 
or equivalent level of CSG measured in the patient versus the 
control can be associated with a cancer which is regressing 

or in remission. 

Another object of the present invention is to provide 
a method of monitoring colon cancer in a patient for the onset 
of metastasis. The method comprises identifying a patient 
having colon cancer that is not known to have metastasized, 
periodically measuring levels of CSG in a sample of cells, 
tissues, or bodily fluid obtained from the patient, and 
comparing the measured CSG levels with levels of CSG in 
preferably the same cells, tissue, or bodily fluid type of a 
control, wherein an increase in measured CSG levels versus 
control CSG levels is associated with a cancer which has 
metastasized. 

Yet another object of the present invention is to 
provide a method of monitoring the change in stage of colon 
cancer in a patient which comprises identifying a patient 
having colon cancer, periodically measuring levels of CSG in 
a sample of cells, tissue, or bodily fluid obtained from the 
patient, and comparing the measured CSG levels with levels of 



. WO 99/60161 



PCT/US99/10498 



- 5 



CSG in preferably the same cells, tissues, or bodily fluid 
tyP e of a control wherein an increase in measured CSG levels 
versus the control CSG levels is associated with a cancer 

. =nHa decrease in the measured CSG levels 
which is progressing and a decrease 

i rsr levels is associated with a cancer 
versus the control CSG levels 

which is regressing or in remission. 

Other obnects, features, advantages and aspects of the 
present invention will become apparent to those of skill in 
the art from the following description. It should be 
understood, however, that the following description and the 
specific examples, while indicating preferred embodiments of 
t he invention, are given by way of illustration only. Various 
changes and modifications within the spirit and scope of the 
disclosed inversion will become readily apparent to those 
skilled in the art from reading the following description and 
from reading the other parts of the present disclosure. 

DESCRIPTION OP THE INVENTION 

„»i„n relates to diagnostic assays and 
The present invention relates iu ^ i 

methods, both quantitative and gualitative for 

diagnosing, monitoring, staging, and prognosticate cancer 

. nf rsG wit h those of CSG in a normal human 
by comparing levels of CSG wicn 

control. What is meant by "levels of CSG" as used herein, 
means levels of the native protein expressed by the genes 
comprising the polynucleotide sequence of any of SEQ ID NO. 
1 2 , 4 5 6, 7, 8 or 9. m the alternative, what is 
meant by' "levels of CSG" as used herein, means levels of the 
native mRNA encoded by any of the genes comprising any of the 
polynucleotide sequences of SEQ ID NO: 1, 2, 3. 4, 5, 6 , 
■- 8 or 6 or levels of the gene comprising any of the 
polynucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 
L 9. Such levels are preferably measured in at least one of, 
cells, tissues and/or bodily floid.. including determination 
of normal and abnormal levels. Thus, for 
diagnostic assay in accordance with the invention for 
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diagnosing over-expression of any one of the CSG proteins 
compared to normal control bodily fluids, cells, or tissue 
samples may be used to diagnose the presence of cancers, 
including colon cancer. Any of the nine CSGs may be measured 
alone in the methods of the invention, or all together or any 
combination of the nine. 

By "control" it is meant a human patient without cancer 
and/or non cancerous samples from the patient, also referred 
to herein as a normal human control; in the methods for 
diagnosing or monitoring for metastasis, control may also 
include samples from a human patient that is determined by 
reliable methods to have colon cancer which has not 
metastasized. 

All the methods of the present invention may optionally 
include measuring the levels of other cancer markers as well 
as CSG. Other cancer markers, in addition to CSG, useful in 
the present invention will depend on the cancer being tested 
and are known to those of skill in the art. 
Diagnostic Assays 

The present invention provides methods for diagnosing 
the presence of colon cancer by analyzing for changes in 
levels of CSG in cells, tissues or bodily fluids compared with 
levels of CSG in cells, tissues or bodily fluids of preferably 
the same type from a normal human control, wherein an increase 
in levels of CSG in the patient versus the normal human 
control is associated with the presence of colon cancer. 
Without limiting the instant invention, typically, for a 
quantitative diagnostic assay a positive result indicating the 
patient being tested has cancer is one in which cells, 
tissues, or bodily fluid levels of the cancer marker, such as 
CSG, are at least two times higher, and most preferably are 
at least five times higher, than in preferably the same cells, 
tissues, or bodily fluid of a normal human control. 

The present invention also provides a method of 
diagnosing metastatic colon cancer in a patient having colon 
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cancer which has not yet metastasized for the onset o£ 
metastasis. In the method of the present invention, a human 
cancer patient suspected of having colon cancer which may have 
metastasized (but which was not previously known to have 
metastasized, is identified. This is accomplished by a 
variety of means known to those of skill in the art. For 
example, in the case of colon cancer, patients are typica y 
diagnosed with colon cancer following traditional detection 

methods- - 
in the present invention, determining the presence of 

CSG level in cells, tissues, or bodily fluid, is particularly 

useful for discriminating between colon cancer which has not 

metastasized and colon cancer which has metastasized. 

Existing techniques have difficulty discriminating between 

colon cancer which has metastasized and colon cancer whxch has 

. »r,rf orooer treatment selection is often 

not metastasized and propei. 

dependent upon such knowledge. 

In the present invention, the cancer marker levels 
measured in such cells, tissues, or bodily fluid is CSG. and 
are compared with levels of CSG in preferably the same cells 
tissue, or bodily fluid type of a normal human control. That 
is. if the cancer marker being observed is just CSG in serum, 
this level is preferably compared with the level of CSG in 
serum of a normal human patient. An increase in the CSG in 
the patient versus the normal human control is associated with 
colon cancer which has metastasized. 

Without limiting the instant invention, typically, for 

. ....^.fic assav a positive result indicating 

a quantitative diagnostic assay w 

the cancer in the patient being tested or monitored has 
metastasized is one in which ell., tissues, or bodily fluid 
ievels of the cancer marker, such as CSG, are at least two 
times higher, and most preferable are at least five times 
higher, than in preferably the same cells, tissues, or bodily 
fluid of a normal patient. 
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Normal human control as used herein includes a human 
patient without cancer and/or non cancerous samples from the 
patient; in the methods for diagnosing or monitoring for 
metastasis, normal human control may also include samples from 
a human patient chat is determined by reliable methods to have 
colon cancer which has not metastasized. 
Staging 

The invention also provides a method of staging colon 
cancer in a human patient. 

The method comprises identifying a human patient having 
such cancer; analyzing a sample of cells, tissues, or bodily 
fluid from such patient for CSG. Then, the method compares 
CSG levels in such cells, tissues, or bodily fluid with levels 
of CSG in preferably the same cells, tissues, or bodily fluid 
type of a normal human control sample, wherein an increase in 
CSG levels in the patient versus the normal human control is 
associated with a cancer which is progressing and a decrease 
in the levels of CSG is associated with a cancer which is 
regressing or ir. remission. 
Monitoring 

Further provided is a method of monitoring colon cancer 
in a human having such cancer for the onset of metastasis. 
The' method' comprises identifying a human patient having such 
cancer that is not known to have metastasized; periodically 
analyzing a sample of cells, tissues, or bodily fluid from 
such patient for CSG; comparing the CSG levels in such cells, 
tissue, or bodily fluid with levels of CSG in preferably the 
same cells, tissues, or bodily fluid type of a normal human 
control sample, wherein an increase in CSG levels in the 
patient versus the normal human control is associated with a 
cancer which has metastasized. 

Further provided by this inventions is a method of 
monitoring the change in stage of colon cancer in a human 
having such cancer. The method comprises identifying a human 
patient having such cancer; periodically analyzing a sample 
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of cells, tissues, or bodily fluid from such patient for CSG; 
comparing the CSG levels in such cells, tissue, or bodily 
fluid with levels of CSG in preferably the same cells, 
tissues, or bodily fluid type of a normal human control 
sample, wherein an increase in CSG levels in the patient 
versus the normal human control is associated with a cancer 
which is progressing in stage and a decrease in the levels of 
CSG is associated with a cancer which is regressing in stage 

or in remission. 

Monitorinc such patient for onset of metastasis is 
periodic and preferably done on a quarterly basis. However, 
this may be more or less frequent depending on the cancer, the 
particular patient, and the stage of the cancer. 

Assay Techniques 

Assay techniques that can be used to determine levels 
of gene expression, such as CSG of the present invention, in 
a sample derived from a host are well-known to those of skill 
in the art. Such assay methods include radioimmunoassays, 
reverse transcriptase PCR (RT-PCR) assays, 
immunohistochemistry assays, in situ hybridization assays, 
competitive-binding assays, Western Blot analyses and ELISA 
assays. Among these, ELISAs are frequently preferred to 
diagnose 'a gene's expressed protein in biological fluids. 
An ELISA assay initially comprises preparing an antibody, if 
not readily available from a commercial source, specific to 
CSG, preferably a monoclonal antibody. In addition a reporter 
antibody generally is prepared which binds specifically to 
CSG. The reporter antibody is attached to a detectable 
reagent such as radioactive, fluorescent or enzymatic reagent, 
for example horseradish peroxidase enzyme or alkaline 
phosphatase. 

To carry out the ELISA, antibody specific to CSG is 
incubated on a solid support, e.g., a polystyrene dish, that 
binds the antibody. Any free protein binding sites on the 
dish are then covered by incubating with a non-specific 
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protein such as bovine serum albumin. Next, the sample to be 
analyzed is incubated in the dish, during which time CSG binds 
to the specific antibody attached to the polystyrene dish. 
Unbound sample is washed out with buffer. A reporter antibody 
specifically directed to CSG and linked to horseradish 
peroxidase is placed in the dish resulting in binding of the 
reporter antibody to any monoclonal antibody bound to CSG. 
Unattached reporter antibody is then washed out. Reagents for 
peroxidase activity, including a colorimetric substrate are 
then added to the dish. Immobilized peroxidase, linked to CSG 
antibodies, produces a colored reaction product. The amount 
of color developed in a given time period is proportional to 
the amount of CSG protein present in the sample. Quantitative 
results typically are obtained by reference to a standard 
curve . 

A competition assay may be employed wherein antibodies 
specific to CSG attached to a solid support and labeled CSG 
and a sample derived from the host are passed over the solid 
support and the amount of label detected attached to the solid 
support can be correlated to a quantity of CSG in the sample. 
Nucleic acid methods may be used to detect CSG mRNA as a 
marker for colon cancer. Polymerase chain reaction (PCR) and 
other nucleic acid methods, such as ligase chain reaction 
(LCR) and nucleic acid sequence based amplification (NASABA) , 
can be used to detect malignant cells for diagnosis and 
monitoring of various malignancies. For example, reverse- 
transcriotase PCR (RT-PCR) is a powerful technique which can 
be used to detect the presence of a specific mRNA population 
in a complex mixture of thousands of other mRNA species. In 
RT-PCR, an mRNA species is first reverse transcribed to 
complementary DNA (cDNA) with use of the enzyme reverse 
transcriptase; the cDNA is then amplified as in a standard PCR 
reaction. RT-PCR can thus reveal by amplification the 
presence of a single species of mRNA. Accordingly, if the 
mRNA is highly specific for the cell that produces it, RT-PCR 
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can be used to identify the presence of a specific type of 
cell . 

Hybridization to clones or oligonucleotides arrayed on 
a solid support (i.e., gridding) can be used to both detect 
the expression of and quantitate the level of expression of 
that gene. In this approach, a cDNA encoding the CSG gene is 
fixed to a substrate. The substrate may be of any suitable 
type including but not limited to glass, nitrocellulose, nylon 
or plastic. At least a portion of the DNA encoding the CSG 
gene is attached to the substrate and then incubated with the 
analyte, which may be RNA or a complementary DNA (cDNA) copy 
of the RNA, isolated from the tissue of interest. 
Hybridization between the substrate bound DNA and the analyte 
can be detected and quantitated by several means including but 
not limited to radioactive labeling or fluorescence labeling 
of the analyte or a secondary molecule designed to detect the 
hybrid. Quantitation of the level of gene expression can be 
done by comparison of the intensity of the signal from the 
analyte compared with that determined from known standards. 
The standards can be obtained by in vitro transcription of the 
target gene, quantitating the yield, and then using that 
material to generate a standard curve. 

The above tests can be carried out on samples, derived from a 
variety of patients' cells, bodily fluids and/or tissue 
extracts (homogenates or solubilized tissue) such as from 
tissue biopsy and autopsy material. Bodily fluids useful in 
the present invention include blood, urine, saliva, or any 
other bodily secretion or derivative thereof. Blood can 
include whole blood, plasma, serum, or any derivative of 
blood. 

EXAMPLES 

The present invention is further described by the 
following examples. These examples are provided solely to 
illustrate the invention by reference to specific embodiments. 
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These exemplifications, while illustrating certain specific 
aspects of the invention, do not portray the limitations or 
circumscribe the scope of the disclosed invention. 
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Exaxnpl 1: CSGs 

searches were carried cut and CSGs identified using the 

„ ayif n f the LIFESEQ® database 
following Search Tools as part of the 

available from Incyte Pharmaceuticals, Palo Alto, CA: 

1 Library Comparison (compares one library to one 
other library) allows the identification of clones expressed 
in tumor and absent or expressed at a lower level m normal 

tissue. . , . 

2 Subsetting is similar to library comparison but 

al lows the identification of clones expressed in a pool of 
libraries and absent or expressed at a lower level in a second 

pool of libraries. 

3 Transcript Imaging lists all of the clones in a 
single library or a pool of libraries based on abundance, 
jndividual clones can then be examined using Electronic 

fVlo H - sue sources of their component 
Northerns to determine the tissue source 

ESTS 

' 4 Protein Function: Incyte has identified subsets of 
ESTs with a potential protein function based on homologies to 
ta own proteins. Some examples in this database include 

. „ We identified some leads 

Transcription Factors and Proteases, we 

^.f=h=.«e for clones whose component E&is 
by searching in this database lor 

showed disease specificity. 

Electronic subtractions, transcript imaging and protein 
function searches were used to identify clones whose 
component ESTs were exclusively or more frequently found in 
libraries from specific tumors. Individual candidate clones 
were examined in detail by checking where each EST originated. 
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Table 1: CSGs 



SEQ ID 


Clone fr 


Gene ID # 












1 




242807 


Transcript Imaging 


o 
Z 


1 785234 

X c* O J ^ ~ 


239588 


Subsetting 


3 




29634 


Transcript Imaging 




fil 6957 

0 X O <J / 


233421 


Subsetting 


5 


775133 


245080 


Subsetting 


6 


1335450 


245811 


Subsetting 


7 


2348122 


233711 


Transcript Imaging 


8 


3228674 


230273 


Subsetting 


9 


1632174 


229022 


Transcript Imaging 



The following example was carried out using standard 
techniques, which are well known and routine to those of skill 
in the art, except where otherwise described in detail. 
Routine molecular biology techniques of the following example 
can be carried out as described in standard laboratory 
manuals, such as Sambrook et al., MOLECULAR CLONING : A 
LABORATORY MANUAL, 2nd Ed.; Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. (1989). 

Example 2: Relative Quantitation of CSG Gene Expression 

Real-Time quantitative PCR with fluorescent Taqman 
probes is a quantitation detection system utilizing the 5'- 
3' nuclease activity of Taq DNA polymerase. The method uses 
an internal fluorescent oligonucleotide probe (Taqman) labeled 
with a 5' reporter dye and a downstream, 3' quencher dye. 
During PCR, the 5' -3' nuclease activity of Taq DNA polymerase 
releases the reporter, whose fluorescence can then be detected 
by the laser detector of the Model 7700 Sequence Detection 
System (PE Applied Biosystems, Foster City, CA, USA) . 
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Amplification of an endogenous control is used to 
standardize the amount of sample RNA added to the reaction and 
normalize for Reverse Transcriptase (RT) efficiency. Either 
cyclophilin, glyceraldehyde-3-phosphate dehydrogenase (GAPDH) 
or 18S ribosomal RNA (rRNA) is used as this endogenous 
control. To calculate relative quantitation between all the 
samples studied, the target RNA levels for one sample are used 
as the basis for comparative results (calibrator). 
Quantitation relative to the "calibrator" can be obtained 
using the standard curve method or the comparative method 
(User Bulletin #2: ABI PRISM 7700 Sequence Detection System). 

To evaluate the tissue distribution, and the level of 
CSGs in normal and tumor tissue, total RNA was extracted from 
normal tissues, tumor tissues, and from tumors and the 
corresponding matched normal tissues. Subsequently, first 
strand cDNA was prepared with reverse transcriptase and the 
polymerase chain reaction was done using primers and Taqman 
probe specific to the CSG. The results were analyzed using 
the ABI PRISM 7700 Sequence Detector. The absolute numbers 
are relative levels of expression of the CSG compared to the 
calibrator. 
Comparative Examples 

Similar mRNA expression analysis for genes coding for 
the diagnostic markers PSA (Prostate Specific Antigen) and 
PLA2 (Phospholipase A2) was performed for comparison. PSA xs 
currently the only cancer screening marker available in 
clinical laboratories. When the panel of normal pooled 
tissues was analyzed, PSA was expressed at very high levels 
in prostate, with a very low expression in breast and testis. 
After analysis of more than 55 matching samples from 14 
different tissues, the data corroborated the tissue 
specificity seen with normal tissue samples. PSA expression 
was compared in cancer and normal adjacent tissue for 12 
matching samples of prostate tissue. The relative levels of 
PSA were higher in 10 cancer samples (83%). Clinical data 
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recently obtained support the utilization of PLA2 as a staging 
marker for late stages of prostate cancer. mRNA expression 
data described herein showed overexpression of the mRNA in 8 
out of the 12 prostate matching samples analyzed (66%). PLA2 
had high levels of mRNA expression in small intestine, 
prostate, liver, and pancreas. 



Measurement of SEQ ID NO:3; Clone ID 1341701; Gene ID 29634 
(Clnl06) 

Absolute numbers are depicted in Table 2 as relative 
levels of expression of Clnl06 (SEQ ID NO: 3) in 12 normal 
different tissues. All the values are compared to normal 
testis (calibrator) . These RNA samples are commercially 
available pools, originated by pooling samples of a particular 
tissue from different individuals. 

Table 2: Relative levels of Clnl06 Expression in Pooled 
Samples 



Tissue 


NORMAL 


Colon-Ascending 


110 


Endometrium 


0 


Kidney 


0 


Liver 


0 


Ovary 


0 


Pancreas 


0 


Prostate 


16 


Small Intestine 


0 


Spleen 


0 


Stomach 


0 


Testis 


1 


Uterus 


0 



The relative levels of expression in Table 2 show for 
the CSG Clnl06 (SEQ ID NO: 3) , mRNA expression is more than 6 
fold higher in the pool of normal ascending colon (110) 
compared with prostate (16). Testis, the calibrator, with a 
relative expression level of 1, is the only other tissue 
expressing the mRNA for Clnl06 (SEQ ID NO: 3). These results 
demonstrate that mRNA expression of this CSG is highly 
specific for colon. 
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The absolute numbers in Table 2 were obtained analyzing 
pools of samples of a particular tissue from different 
individuals. They can not be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 

individual in Table 3. 

The absolute numbers In Table 3 are relative levels of 
expression of Clnl06 (SEQ ID NO: 3) in 57 pairs of matching 
samples. All the values are compared to normal testis 
(calibrator) . A matching pair is formed by mRHA from the 
cancer sample for a particular tissue and mBNA from the normal 
adjacent sample for that same tissue from the same individual. 
Tabl. 3: Relative levels of ClnlOS Expree.ion in Indxvxdual 
Samples 
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Pan 71XL 


Pancreas 1 


4.08 


0.1 


Pan 82XP 
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Prostate 1 


0.3 
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civ J " ^ 
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0 

1.26 


0.2 
0 



0= Negative 

When matching samples were analyzed, the higher levels 
of expression were in the colon, showing a high degree of 
tissue specificity for this tissue. These results confirm the 
tissue specificity results obtained with the panel of normal 
pooled samples (Table 2) . Furthermore, the level of mRNA 
expression in cancer samples, and the isogenic normal ad D acent 
tissue from the same individual were compared. This 
comparison provides an indication of specificity for the 
cancer stage (e.g. higher levels of mRNA expression in the 
cancer sample compared to the normal adjacent) . Table 3 shows 
overexpression of Clnl06 (SEQ ID NO: 3) in 15 colon cancer 
tissues compared with their respective normal adjacent (colon 
samples #1, 2, 3, 4, 5, 6, 8, 9, 10, 12, 13, 16, 20, 21, and 
22) There is overexpression in the cancer tissue for 65% of 
the colon matching samples tested (total of 23 colon matching 
samples) . The matching sample Pan 71XL is a secondary cancer 
in pancreas, the primary cancer in that individual was a 

duodenal cancer. 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in 65% of the colon matching samples 
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tested are demonstrative of CSG Clnl06 (SEQ ID NO: 3) being a 
diagnostic marker for colon cancer. 

Measurement of SEQ ID NO: 4; Clone ID 816257; Gene ID 406452 
(Clnl07) 

Absolute numbers as depicted in Table 4 are relative 
levels of expression of CSG Clnl07 (SEQ ID NO: 4) in 12 normal 
different tissues. All the values are compared to normal 
small intestine (calibrator) . These RNA samples are 
commercially available pools, originated by pooling samples 
of a particular tissue from different individuals. 
Table 4: Relative levels of Clnl07 Expression in Pooled 



Samples 



Tissue 


NORMAL 


Colon-Ascending 


3.2 


Endometrium 


0 


Kidney 


0.2 


Liver 


0 


Ovary 


0 


Pancreas 


0 


Prostate 


0.1 


Small Intestine 


1 


Spleen 


0 


Stomach 


0.3 


Testis 


0 


Uterus 


0 



The relative levels of expression in Table 4 show that 
mRNA expression of the CSG Clnl07 (SEQ ID NO: 4) is more than 
10 fold higher in the pool of normal ascending colon (3.2), 
five fold higher in small intestine (1), and 1.5 fold higher 
in stomach (0.3), compared with the next higher expressor (0.2 
for kidney) . Seven of the pooled tissues samples analyzed 
were negative and prostate showed a relative expression of 0.1 
for Clnl07 (SEQ ID N0:4). These results demonstrate that 
Clnl07 mRNA expression is highly specific for colon, small 
intestine, and in a lower degree for stomach. 

The absolute numbers in Table 4 were obtained analyzing 
pools of samples of a particular tissue from different 
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individuals. They can not be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 

individual in Table 5. 

The absolute numbers in Table 5 are relative levels of 
expression of Clnl07 (SEQ ID NO: 4) in 57 pairs of matching 
samples. All the values are compared to normal small 
intestine (calibrator) . A matching pair is formed by mRNA 
from the cancer sample for a particular tissue and mRNA from 
the normal adjacent sample for that same tissue from the same 
individual. 

Table 5: Relative levels of Clnl07 Expression in Individual 
Samples 
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0= Negative 

When matching samples were analyzed, the higher levels 
of expression were in colon, stomach, and small intestine, 
showing a high degree of tissue specificity for colon tissues. 
These results confirm the tissue specificity results obtained 
with normal pooled samples (Table 4). Furthermore, the level 
of mRNA expression in cancer samples and the isogenic normal 
adjacent tissue from the same individual were compared. This 
comparison provides an indication of specificity for the 
cancer stage (e.g. higher levels of mRNA expression in the 
cancer sample compared to the normal adjacent) . Table 5 shows 
overexpression of Clnl07 (SEQ ID NO: 4) in 11 colon cancer 
tissues compared with their respective normal adjacent (colon 
samples #5, 7, 10, 11, 12, 13, 15, 17, 18, 20, and 21). There 
is overexpression in the cancer tissue for 48% of the colon 
matching samples tested (total of 23 colon matching samples) . 
The matching sample Pan 71XL is a secondary cancer in 
pancreas, the primary cancer in that individual was a duodenal 
cancer. 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in almost half of the colon, stomach, 
and small intestine matching samples tested are demonstrative 



WO 99/60161 PCT/US99/10498 . 

- 24 - 

of CSG Clnl07 (SEQ ID NO: 4) being a diagnostic marker for 
colon cancer. 

Measurement of SEQ ID NO: 5; Clone ID 775133; Gene ID 24508 
(Clnl08) 

The absolute numbers shown in Table 6 are relative 
levels of expression of CSG Clnl08 (SEQ ID NO: 5) in 12 normal 
different tissues. All the values are compared to normal 
small intestine (calibrator) . These RNA samples are 
commercially available pools, originated by pooling samples 
of a particular tissue from different individuals. 
Table 6: Relative levels of Clnl08 Expression in Pooled 



Samples 



Tissue 


NORMAL 


Colon-Ascending 


2846.5 


Endometrium 


1 


Kidney 


5.5 


Liver 


18.7 


Ovary 


3.4 


Pancreas 


198.1 


Prostate 


1024 


Small Intestine 


810. 8 


Spleen 


32.2 


Stomach 


9981.2 


Testis 


0 


Uterus 


294.1 



The relative levels of expression in Table 6 show that mRNA 
expression of CSG Clnl08 (SEQ ID NO: 5) is more than 10 fold 
higher in the pool of normal ascending colon (2846.5) and 
almost ten fold higher in stomach (9981.2), compared to the 
expression level in any other tissue analyzed. These results 
demonstrate that mRNA expression of this CSG is also highly 
specific for colon and stomach. 

The absolute numbers in Table 6 were obtained analyzing 
pools of samples of a particular tissue from different 
individuals. They can not be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 7 . 
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The absolute numbers depicted in Table 7 are relative 
levels of expression of Clnl08 (SEQ ID NO: 5) in 57 pairs of 
matching samples. All the values are compared to normal small 
intestine (calibrator) . A matching pair is formed by mRNA 
from the cancer sample for a particular tissue and mRNA from 
the normal adjacent sample for that same tissue from the same 
individual . 

Table 7 : Relative levels of ClnlOS Expression in Individual 

Samples 
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0= Negative 

When matching samples were analyzed, the higher levels 
of expression were in colon and stomach, showing a high degree 
of tissue specificity for these two tissue*. These results 
confirm the tissue specificity results obtained with normal 
pooled samples (Table 6). Furthermore, the level of mRNA 
expression in cancer samples and the isogenic normal adjacent 
tissue from the same individual were compared. This 
comparison provides an indication of specificity for the 
cancer stage (e.g. higher leveis of mRNA expression in the 
cancer sample compared to the normal adjacent) . Table 7 shows 
overexpression of CSG ClnlOS (SEQ ID NO: 5) in 13 colon cancer 
tissues compared with their respective normal- adjacent (colon 

, . .1 1 6 7 8, 9, 10, 11, 12, 13, 18, 21, and 22). 

samples #1/ 5, /, AV ' 

There is overexpression in the cancer tissue for 56% of the 
colon matching samples tested (total of 23 colon matching 
samples). The matching sample Pan 71XL is a secondary cancer 
in pancreas, the primary cancer in that individual was a 

duodenal cancer. 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in more than half of the colon, 
stomach, and small intestine matching samples tested are 

~* .hi. rsG Clnl08 (SEQ ID NO: 5), also being 
demonstrative of this Cbfc, uiru.wo v 

a diagnostic marker for colon cancer. 
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Measurement of SEQ ID NO: 7; Clone ID 2348122; Gene ID 23371 
(Clnl09) 

The absolute numbers depicted in Table 8 are relative 
levels of expression of CSG Clnl09 (SEQ ID NO: 7) in 12 normal 
different tissues. All the values are compared to normal 
ovary (calibrator) . These RNA samples are commercially 
available pools, originated by pooling samples of a particular 
tissue from different individuals. 

Table 8: Relative levels of Clnl09 Expression in Pooled 
Samples 



Tissue 


NORMAL 


Colon-Ascending 


28.8 


Endometrium 


0.45 


Kidney 


0.41 


Liver 


0.72 


Ovary 


0.07 


Pancreas 


82.8 


Prostate 


124.3 


Small Intestine 


626.4 


Spleen 


1.2 


Stomach 


12.05 


Testis 


1.51 


Uterus 


52.99 



The relative levels of expression in Table 8 show that mRNA 
expression of CSG Clnl09 (SEQ ID NO: 7), is more than 5 fold 
higher in the pool of normal small intestine (626.4) compared 
to the expression level in any other tissue analyzed. These 
results demonstrate that Clnl09 (SEQ ID NO: 7) mRNA expression 
is highly specific for small intestine 

The absolute numbers in Table 8 were obtained analyzing 
pools of samples of a particular tissue from different 
individuals. They can not be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 9. 

The absolute numbers depicted in Table 9 are relative 
levels of expression of Clnl09 (SEQ ID NO: 7) in 53 pairs of 
matching samples. All the values are compared to normal ovary 
(calibrator) . A matching pair is formed by mRNA from the 
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cancer sample for a particular tissue and mRNA from the normal 
adjacent sample for that same tissue from the same individual. 

Relative levels of Clnl09 Expression in Individual 



Table 9: 
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0= Negative 
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When matching samples were analyzed, the higher levels 
of expression were in small intestine, colon and stomach, 
showing a high degree of tissue specificity for these three 
colon tissues. These results confirm the tissue specificity 
results obtained with normal pooled samples for small 
intestine (Table 8). Furthermore, the level of mRNA 
expression in cancer samples and the isogenic normal adjacent 
tissue from the same individual were compared. Thxs 
comparison provides an indication of specificity for the 
cancer stage (e.g. higher levels of mRNA expression in the 
cancer sample compared to the normal adjacent) . Table 9 shows 
overexpression of CSG, Clnl09 (SEQ ID NO:7) in 15 colon cancer 
tissues compared with their respective normal adjacent (colon 

, „, 7 3 6 7 8 , 9, 12, 13, 15, 16, 17, 19, 20, and 
samples #1/ 2, J, b, / ' 

21) There is overexpression in the cancer tissue for 68% of 
the colon matching samples tested (total of 22 colon matching 
samples) . The matching sample Pan 71XL is a secondary cancer 
in pancreas, the primary cancer in that individual was a 

duodenal cancer. 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in more than half of the colon, 
stomach, and small intestine matching samples tested are 
demonstrative of CSG Clnl09 (SEQ ID NO: 7) being a diagnostic 
marker for colon cancer. The amino acid sequence encoded by 
the open reading frame of Clnl09 is depicted in SEQ ID NO:10. 
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What is claimed is: 

1. A method for diagnosing the presence of colon cancer 
in a patient comprising: 

(a) measuring levels of CSG in a sample of cells, tissue 
or bodily fluid obtained from the patient; and 

(b) comparing the measured levels of CSG with levels of 
CSG in a sample of cells, tissue or bodily fluid obtained from 
a control, wherein an increase in measured levels of CSG in 
the patient versus the CSG levels in the control is associated 
with the presence of colon cancer. 

2. A method of diagnosing metastatic colon cancer in 

a patient comprising: 

(a) measuring levels of CSG in a sample of cells, 
tissue, or bodily fluid obtained from the patient; and 

(b) comparing the measured levels of CSG with levels of 
CSG in a sample of cells, tissue, or bodily fluid obtained 
from a control, wherein an increase in measured CSG levels in 
the patient versus the CSG levels in the control is associated 
with a cancer which has metastasized. 

3. A method of staging colon cancer in a patient 
comprising-: 

(a) identifying a patient suffering from colon cancer; 

(b) measuring levels of CSG in a sample of cells, 
tissue, or bodily fluid obtained from the patient; and 

(c) comparing the measured levels of CSG with levels of 
CSG in a sample of cells, tissue, or bodily fluid obtained 
from a control, wherein an increase in the measured levels of 
CSG versus the levels of CSG in the control is associated with 
a cancer which is progressing and a decrease in the measured 
levels of CSG versus the levels of CSG in the control is 
associated with a cancer which is regressing or in remission. 
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4 . A method of monitoring colon cancer in a patient 
for the onset of metastasis comprising: 

(a) identifying a patient having colon cancer that is 
not known to have metastasized; 

(b ) periodically measuring CSG levels in samples of 
cells, tissue, or bodily fluid obtained from the patient; and 

( c) comparing the periodically measured levels of CSG 
with levels of CSG in cells, tissue, or bodily fluid obtained 
from a control, wherein an increase in any one of the 
periodically measured levels of CSG in the patient versus the 
levels of CSG in the control is associated with a cancer which 
has metastasized. 

5. A method of monitoring changes in a stage of colon 
cancer in a patient comprising: 

(a) identifying a patient having colon cancer; 

(b ) periodically measuring levels of CSG in samples of 
cells, tissue, or bodily fluid obtained from the patient; and 

... „ a ,. ure d levels of CSG with levels of 

(c) comparing the measurea levels 

no Hc;cnP or bodily fluid of 
CSG in a sample of the same cells, tissue, or do y 

a control, wherein an increase in any one of the P"**«»"* 

measured levels of CSG versus levels of CSG in the control 

is associated with a cancer which is progressing in stage and 

a decrease in any one of the periodically measured levels of 

CSG versus the levels of CSG in the control is assorted with 

a cancer which is regressing in stage or in remission. 

- i ■ i o A nr 5 wherein the CSG 
6. The method of claim 1, 2, 3, 4 or o 

comprises SEQ ID N0:3, 4, 5 or 7. 
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<140> 
<141> 
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<160> 10 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 487 

<212> DNA 

<213> Homo sapiens 

<400> 1 

tctgcatctg gccctcccag tgcacctgtt caatcccagc ycctccctga cctgtacaaa 60 

tacacctgag gaccggctcg agcccagact tcctgcccct gctctgcact ctcaggtatt 120 

ccctgctctt actccaaaea gatggaccca ggtccgaagg ggcactgcca ctgtgggggg 180 

catggccatc ctccaggtca ctgcgggcga acccctggcc atggcccagg gccctgcggg 24 0 

ccaccccctg gccatggccc agggccctgc gggcaacccc ctggccatgg cccagggccc 300 

tgcgggcctc cccctggcca tggcccaggt cacccacccc ctggtccaca tcactgagga 360 

agtagaagaa aacaggacac aagatggcaa gcctgagaga attgcccagc tgacctggaa 420 

tgaggcctaa accacaatct tctcttccta ataaacagcc tcctagaggc cacattctat 480 
tctttaa 487 

<210> 2 

<211> 739 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> (693) 

<220> 

<221> unsure 
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<222> (696) . . (699) 
<220> 

<221> unsure 
<222> (703) . . C?05) 

<220> 

<221> unsure 
<222> (706) 

<220> 

<221> unsure 
<222> (710; . . (716) 

<220> 

<221> unsure 
<222> (716) 

<220> 

<221> unsure 
<222> (723) . . (726) 

<220> 

<221> unsure 
<222> (728) 

<220> 

<221> unsure 
<222> (732) 

<220> 

<221> unsure 
<222> (737; 



tctgaaacto tcagttccac cagcacgct tggatactgg taagtttcca gggW^c. 
ttgcatctga aactgtcagc cccagaatgt tgacagtcgc tctcctagcc ^ctctgtg 120 
ccLagcctc tggcaatgcc attcaggcca ggtcttcctc ctatagtgga «>£.tgg.. 
gtggtgatgg ...gcgatte tctcattctg gcaaccagtt ggacggcccc atcaccgccc 24C 
tccgg^ccg actcaacaca tactacatcg taggtcttca ggtgcgctat ggcaaggtg. 300 
gga 9 gcgacta tgxgggtggt cgcaacggag acctggagga gatctttctg caccc ggg 
aatcagtoat ccaggtttc, gggaagtaca agtggtacct gaagaagctg 9«tttgrg. 42C 
cagacLggg ccgctatctg tcttttggga aagacagtgg cacaagtttc aatgccgtcc 
cc gcacc! caacaccgtg ctccgcttca tcagtggccg gtctggttct ctcatcga g 
ccattggcct gcactgggat gtttacccca ctagctgcag cagatactga ^ctcctctc 600 
cttgoclgog acctgtg.t gaggagtaag aactccctta tcactaaccc ccatccaaat 660 
ggctcaataa aaaaatatgg ttaaggctaa aanaaaanng gannnaanan nnnnnntnc. 720 
aannnnantt cncctgnic 
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<210> 3 

<211> 428 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 
<222> (391) 



<400> 3 

aattgtccgg ggtcaaacag aggagagcat 
gtcttttgag atgtggca-g atcgtgactc 
tccccctgag tcttggaagr ggatccttgc 
cctccggttt taccgctccc agcagaaggt 
caaagttttg gaattgcaga tgaacaagcg 
ctttgttaau tgcccczcaa tctctctcct 
ctccagagga agatttct.c ttcattcata 
tataaggg 



gaatgagagt catcctcgca agtgtgcaga 60 

ccactgtagg cgccctaagt ttgaagggca 12C 

accggtcatt ctttatatct gtgaaaggat 180 

tgtgattacc aaggttgtta tgcacccatc 24C 

tggcttcagc atggaagtgg ggcagtatat 300 

gggaatggca rccttttact t ugacctctc 36C 

tncqagcagc aggggacttg acagaaaatc 42C 

42 = 



<210> 4 

<211> 1347 

<212> DNA 

<213> Homo sapiens 

<400> 4 

ggaaaacccc tgagcacaaa gcaagaggca 

aacaggggtg tcgtgcggzg ggagtacttc 

gacgagcccc agcaggccca agtcccccat 

gcactgaggc tgcagaag— ccagtcatct 

ctgcgccggg agcaagagg- ggcagaggag 

tccccaacgc cagatgacsa ctctgaccag 

atcacgggca gttactcggi gtctgagtct 

aacgtggcgt ggacagtgga agatccagtg 

gagcaatggt acgctggcat caacccctcg 

atacgggtga cccgtcacaa gaacgccatg 

agtgaggagg atgactgagc ctcgggatgg 

cgtgggaact gccaagacca tcgccaagcc 

gatccaagaa ccacaccrca tctgccaaca 

ggtttttcgt ttccct—ct atcttccttt 

ttttggggat gaaatgggga cccctgctga 

ccctgggtct aagaaagaaa gagacccgct 

tagtctgagg gcagagccac cggtcaaaga 

ctgcgggtct gagggaggag acactcagct 
ggagacctgc ccctttccaa ccctgggaaa 
ccagtgctgg gggtcccrgt gcagccctca 

ctggtggaca caaagctc" acatcgatag 
gagaaaagtq acccagtzta ggagctggag 



tcgaagcccc ctcggggatg cccgcaagcc 60 
cgcctgcgtc ctctgcggtt cagggcccca 120 
gtctggggct gggaggtggc tggggcccct 18C 
gatctgctgg aaagggagag ggagagtgtc 240 
cggagaaatg ctctcttccc agaggtcttc 300 
aactccagga gctcctccca ggcatccggc 361 
cccttcttca gccccatcca cctacactca 42C 
gacagtgctc ctcccgggca gagaaagaag 4 8C 
gacggtatca actcagaggt cctggaagcc 54 0 
gcagagcgct gggaatcccg catctacgcc 60C 
ggcgcccacc ccctgccctg ccctgaccct 660 
cccaccctag gaaatgggtc ctaggtccag 720 
atcccaccat gggcacattt gggactgttg 780 
agaaatgttt ctgcctttgg ggtctaaagc 84 Z 
ttctttctgc ttctaagact ttgccaaatg 9CC 
cctccacttt caggtgtaat ttgcttcccc 96C 
gggtggcaca gatcgcagca ccttgagggg 1C20 
cctccctctg agaagtccca agctgagagg 1080 
ccatccagtc tgagggagga ggccaaactc 114 0 
aacccttcac cttggtgcac ccagccacac 1200 
gatcccatga ggatggtccc cttcacctgg 1260 
gggggtcttt gtcccccacc cccaaactgc 1320 
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cctgaaataa acctggagtg agctgcc 

<210> 5 

<211> 1249 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> (1034) . . {1046! 

ggcagagcct gcgcagggca ggagcagctg gcccactggc ggcccgcaac actccgtctc SO 
accctctggg cccactocat ctagaggagg gccgtctgtg aggccactac ccctccagca 12, 
actgggaggt ggg.ctgtc. gaagctggcc cagggtggtg gtcagctggg tcagggacct 180 
aacggcacct ggctgggacc acctcgcctt ctccatcgaa gcaggggaag tgggagcc.c 240 
gagccccgg gtggaagctg accccaagcc .cccttc.cc tggacagga ^gagtgtc 00 
ggtgtgettc ac=-.cctgg= cctctcttt gccatagtca cgacatggat g^t.ttcg- ,6 
agctacatga gct.cagcat gaaaaccatc cgtctgccac gctggcggc ctcgcccacc 20 
aaggagatcc aggttaaaaa gtacaagtgt ggcctcatca agccctgccc agccaactac 4 0 
tttg4ttta aaatctgcag tggggccgcc aacgtcgtgg gccctactat gtgctttgaa 540 
gaccgcatga tca.gagtcc tgtgaaaaac aatgtgggca gaggcctaaa "tcgccctg 00 
gtgaatggaa ccacgggagc tgtgctggga cagaagtcat ttgacatgta ctctggagat 660 
Jttatgcacc tagtgaaatt ccttaaagaa attccggggg gtgcactggt 9ctggtggcc 720 
tcctacgacg atccagggac caaaatgaac gatgaaagca ggaaactctt ctctgacttg 780 
gggagttcct acgcaaaaca actgggcttc cgggacagct gggtcttcat .gg.gcc... 40 
gacctcaggg gtaaaaoccc ctttgagcag ttcttaaaga acagcccaga cacaaacaaa 900 
tacgago^t ooccagagct gctggagatg gagggctgca tgcccccgaa gccattttag 960 
ggtggc^gtg gctcttcctc agccaggggc ctgaagaagc tcctgcctga cttaggagtc 020 
Igagc Jgc I ggnnnnnn n nnnnnnnnnn nnnnnntgct gcgtggaagg tgctgcagg 1080 
ccttgcacgc tgtgtcgcgc ctctcctcct cggaaacaga accctcccac agcacatcct 14, 
acccgoaaoa ccagcctcag agggtccttc tggaaccagc tgtctgtgga gagaatgggg 120. 
tgctttcgtc agggactcct gacggctggt cctgaggaag gacaaactg 

<210> 6 

<211> 1220 

<212> DNA 

<213> Homo sapiens 

gcttt«gca cctcattcca catcaggagc gtttttggag aaagctgcac tctgttg.gc 60 
tccaggocac aotggaggga gggagtgaag gagctctctg tacccaagga aagtgcagct £ 
gagactcaga caagattaca atgaaccaac tcagcttcct gctgtttctc V-gcgacca 
ccLggatg gactacagat gaggctaata cttacttcaa ggaatggacc tgttcttcgt 240 
ctccafctct occcagaagc tgcaaggaaa tcaaagacga atgtcctagt gcatttgatg 300 
gcctgtattt ; c «cgc. C t gagaatggtg ttatctacca gaccttctgt gacatgacct 60 
ctggggatgg cggctgg.cc ctggtggcc. gcgtgcacga gaatgacatg cgtgggaagt 4.0 
gcacggtggo coatcoctcg tccagtcagc agggcagcaa agcagtctac ccagaggggg 4 8C 
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acggcaactg ggccaactac aacacctttg 
acaagaaccc tggctactac gacatccagg 
ataagtcccc catgcagcac tggagaaaca 
gcttcctcca gacaccggga cataatctgt 
atggagaagg aaactgttgg actgacaacg 
gcgacgccca gaaaacagca tcttattact 
gatttgttca gttcagggta tttaataacg 
tgagggtcac cggatgtaac actgagcacc 
aggccagtcc ccagcagrgt ggagattttt 
atgttggtta cagcagcaqc cgtgagataa 
gagagttttc tgggagggaa cccagacctc 
gaacaactta cccagtacct agaatgttaa 
act caaaaaa aaaaaaaaag 



gatctgcaga 


ggcggccacg 


agcgatgacr 


540 


ccaaggacct 


gggcatctgg 


cacgtgccca 


600 


gctccctgct 


gaggtaccgc 


acggacactg 


660 


ttggcatcta 


ccagaaataz 


ccagtgaaat 


720 


gcccggtgat 


ccctgtggtc 


tatgattttg 


780 


caccctatgg 


ccagcgggaa 


trcactgcgg 


840 


agagagcagc 


caacgccttg 


tgtgctggaa 


900 


actgcattgg 


tggaggagga 


tactttccag 


960 


ctggttttga 


ttggagtgga 


r atggaactc 


1020 


ctgaggcagc 


tgtgcttcta 


t rctatcgtt 


1060 


tcctcccaac 


catgagatcc 


caaggatgga 


1140 


tggcagaaga 


gaaaacaata 


aatcatattg 


1200 
1220 



<210> 7 

<211> 2796 

<212> DNA 

<213> Homo sapiens 



<400> 7 

cggctcgagg gacacgatga ggcccggcct ctcatttctc ctagcccttc tgttcttcct 60 
tggccaagct gcaggggatt tgggggatgt gggacctcca attcccagcc ccggcttcag 120 
ccctttccca ggtgtrgact ccagctccag cttcagctcc agctccaggt cgggctccag 180 
ctccagccgc agcttaggca gcggaggttc tgtgtcccag ttgttttcca atttcaccgg 24 0 
ctccgtggat gaccgtggga cctgccagtg ctctgtttcc ctgccagaca ccacctttcc 300 
cgtggacaga gtggaacgct tggaattcac agctcatgtt ctttctcaga agtttgagaa 360 
agaactttcc aaagtgaggg aatatgtcca attaattagt gtgtatgaaa agaaactgtt 420 
aaacctaact gtccgaantg acatcatgga gaaggatacc atttcttaca ctgaactgga 480 
cttcgagctg atcaaggtag aagtgaagga gatggaaaaa ctggtcatac agctgaagga 54C 
gagttttggt ggaagcccag aaattgttga ccacctggag gtggagataa gaaatatgac 600 
tctcttggta gagaagcttg agacactaga caaaaacaat gtccttgcca ttcgccgaga 660 
aatcgtggcv ctgaacacca agctgaaaga gtgtgaggcc tctaaagatc aaaacacccc 72C 
tgtcgtccac cctcctccca ctccagggag ctgtggtcat ggtggtgtgc tgaacatcac 78C 
caaaccgtct gtgg^cagc tcaactggag agggttttct tatctatatc gtgcttgggg 840 
tagggattac tczccccacc atccaaacaa aggactgtat tgggtggcgc cattgaatac 900 
agatgggaga ctgttggag- attatagact gtacaacaca ctggatgatt tgctattgta 960 
tataaacgct cgagagttgc ggatcaccta tggccaaggt agtggtacag cagtttacaa 1020 
caacaacatg tacgtcaaca tgtacaacac cgggaatatt gccagagtta acctgaccac 1080 
caacacgati gctctgaczc aaactctccc taatgctgcc tataataacc gcttttcata U4C 
tgctaatgtt gcttggcaag atattgactt tgctgtggat gagaatggat tgtgggttaz 120C 
ttattcaact gaagccagca ctggtaacat ggtgattagt aaactcaatg acaccacaci 126G 
tcaggtacta aacacttgct ataccaagca gtataaacca tctgcttcta acgccttca- 132C 
ggtatgtggg gttc^guatg ccacccgtac tatgaacacc agaacagaag agatttttTia 1360 
ctattatgac acaaacacag ggaaagaggg caaactagac attgtaatgc ataagatgca 1440 
ggaaaaagtg cagaccatta actataaccc ttttgaccag aaactttatg tctataacga 150C 
tggttacctt ctgaattatg atcttrctgt cttgcagaag ccccagtaag ctgtttagga 1560 
gttaggcrga aagagaaaat gtttgrtgaa aaaatagtci: tctccactta cttagatatc 162C 
tgcagggg^c tctaaaagtg tgttcatttt gcagcaatgt ttaggtgcai agcrctacca 168C 
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cactagagat ctaggacatt tgtcttgatt tggtgagttc tcttgggaat catctgcctc 1740 
ttcaggcgca ttttgcaata aagtctgtct agggtgggat tgtcagaggt ctaggggcac 00 
tgtgggccta gtgaagccta ctgtgaggag gctrcactag aagccttaaa ttaggaatta 8 0 
aggaacttaa aactcagtat ggcc.ctagg gattctttgt acaggaaata «,ccc«t, 
actagtcctc atccatgtag caccactaat tcttccatgc ctggaagaaa cctggggact 1980 
tagttaggta gattaatatc tggagctcct cgagggacca aatctccaac ttttttttcc 2040 
cctcactlca cctggaa.ga tgctttgt.t gtggcagata agtaaatttg gcatgctta 2 00 
atattctaca tctgtaaagt gctgagtttt atggagagag gccttttta gca ta att 2160 
gtacatggca aataaatccc agaaggatct gtagatgagg cacctgcttt "cttttctc 2220 
Lttgtcca ccttactaaa agtcagtaga atcttctacc tcataacttc cttcca 2280 
cagctcagaa gattagaacc agacttacta accaattcca ccccccacca acccccttc. 40 
"tgcctact ttaaaaaaat taa.agtttt ctatggaact gatctaagat tagaaaaatt 00 
a^ttcttt aatttcatta tggactttta tttacatgac tctaagacta taagaaaatc 
tgatggcaor gacaaagtgc tagcatttat tgttatctaa taaagacctt ggagcatatg .520 
^gcaacttat gagtgtatca gttgttgct gt-tttttg cctttgttta agcctggaac 25 C 
ttgtaagaaa Itgaaaattt ..tttttttt tctaggacga gcta.agaaa agctat gag 64. 
ag atctact t.«c.gtgc agtaottgga aaccttgcg ctgtatgrga tgtgcttctg 2 ; 
tgcttttg.. tg.ctt«-.c atcagtctt tgrctgtttt tcctttga.g ««gtc. 
agtctatagg attggcagtt taaatgcttt actccc 

<210> 8 

<211> 2331 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> (675) 

tJtatcacgg gctcaactgc aacaaaacac ttccttgaca gctccacaaa ct«ggcc.c 60 
agtgaggaa, caacaa.att ccacagcagc ccagatgcaa gtggaacaac accctcatc. 12 
gcccacrcca caacccagg tcg.ggagaa tctacaacct cacgcatcag «« ggcte^ ; 
Ltgaaataa caacgttacc tggcagtacc acaacaccag gcctcagtga ^ atc "« ?4- 
aJtctaca ataaccccag atcaccagac caaacactct cacctgccag catgagaagc 30C 
tec gca ca ^tggagaacc caccagcttg ta.agccaag cagagtcaac -acacaaca 3 ; 
gcgtLctg ccagcaccac cacc.caggc ctcagtcagg aatcaacaac tttcc.cagt 42. 
aagecagget caactgaaac aacactgrcc cctggcagca tcacaacttc atettttge. 480 
c a^aftta cc.ccc«c tagccaacca ggctcagctc tgtcaacagt ^cacctgcc 5 , 
agcaccacao tgccaggcct tagtgaggaa tctaccacct tctacagcag cccggctc. 60. 
actgaaacca c.gcgtt«c tcacagcaac acaatgtcca ttcatagtca acaatctac 66. 
cccttcccto acagnccagg cttcactcac acagtgttac ctgccaccct «c..cc.c. 72. 
gacattggic aggaarcaac agccttccac agcagctcag aegcaactgg. aacaacaccc 7 
gaca"gg^ 9 * ntpaaacct t qttggagaac ctacaacttt ctacatcagc 84C 

ttacctgccc gctccacagc ctcagacctt gtiggay" „ rr . ar -*nan 900 

ccatccccta cttacacaac actctttcct gcgagttcca gcacatcagg cctcactgag 900 
glatctacca ccttccacac cagtccaagc ttcacttcta caattgtgtc "«gaaagc 
ctggaaacct tagcaccagg gttctgccag gaaggacaaa ttrcgaatgg a-acaatgc C O 
gtctgtcccc aaggctacgt tggtc.ccg tgcttgtccc ctctggaatc c.tccctgi. 1060 
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gaaaccccgg aaaaactcaa cgccacttta ggtatgacag tgaaagtgac ttacagaaat 1140 
ttcacagaaa agatgaatga cgcatcctcc caggaatacc agaacttcag taccctcttc 1200 
aagaatcgga tggatgtcgt tttgaagggc gacaatcttc ctcagtatag aggggtgaac 1260 
attcggagat tgctcaacgg tagcatcgtg gtcaagaacg atgtcatcct ggaggcagac 1320 
tacactttag agtatgagga actgtttgaa aacctggcag agattgtaaa ggccaagatt 1380 
atgaatgaaa ctagaacaac tcttcttgat cctgattcct gcagaaaggc catactgtgc 1440 
tatagtgaag aggacacttt cgtggattca tcggtgactc cgggctttga cttccaggag 1500 
caatgcaccc agaaggctgc cgaaggatat acccagttct actatgtgga cgtcttggat 1560 
gggaagctgg cctgtgtgaa caagtgcacc aaaggaacga agtcgcaaat gaactgtaac 1620 
ctgggcacat gtcagctgca acgcagtgga cccccgctgc ctgtgcccaa aracgaacac 1680 
acactggtac tggggagaga cctgtgaatt caacatcgcc aagagcctcg tgtatgggat 1740 
cgtgggggct gtgatggcgg tgctgctgct cgcattgatc atcctaatca tcttattcag 1800 
cctatcccag agaaaacggc acagggaaca gtatgatgtg cctcaagagt ggcgaaagga 1860 
aggcacccct ggcatcttcc agaagacggc catctgggaa gaccagaatc tgagggagag 1920 
cagattcggc cttgagaacg cctacaacaa cttccggccc accctggaga crgttgactc 1980 
tggcacagag ctccacatcc agaggccgga gatggtagca tccacrgtgt gagccaacgg 204C 
gggcctccca ccctcatcza gctctgttca ggagagctgc aaacacagag cccaccacaa 210C 
gcctccgggg cgggrcaaga ggagaccgaa gtcaggccct gaagccggtc crgctctgag 2160 
ctgacagact tggccagtcc cctgcctgtg ctcctgctgg ggaaggctgg gggctgtaag 2220 
cctctccatc cgggagcttc cagactccca gaagcctcgg cacccctgtc tcctcccggg 2280 
tggctcccca ctctggaatt tccctaccaa taaaagcaaa tctgaaagct c 2331 

<210> 9 

<211> 909 

<212> DNA 

<213> Homo sapiens 

<400> 9 

gaggaggtgg gcgccaacag acaggcgatt aatgcggctc ttacccaggc aaccaggact 60 

acagtataca ttgtggacat tcaggacata gattctgcag ctcgggcccg acctcactcc 120 

tacctcgatg cctactttgt cttccccaat gggtcagccc tgacccttga tgagctgagt 180 

gtgatgatcc ggaatgatca ggactcgctg acgcagctgc tgcagctggg gczggtggtg 24 0 

ctgggctccc aggagagcca ggagtcagac ctgtcgaaac agctcatcag tgtcatcata 300 

ggattgggag tggctttgcr gctggtcctt gtgatcatga ccatggcctt cgtgtgtgtg 360 

cggaagagct acaaccggaa gcttcaagct atgaaggctg ccaaggaggc caggaagaca 420 

gcagcagggg tgatgccc-c agcccctgcc atcccaggga ctaacatgta caacactgag 4 80 

cgagccaacc ccatgctgaa cctccccaac aaagacctgg gcttggagta cctctctccc 540 

tccaatgacc tggactctgt cagcgtcaac tccctggacg acaactctgt ggatgtggac 600 

aagaacagtc aggaaatcaa ggagcacagg ccaccacaca caccaccaga gccagatcca 660 

gagcccctga gcgtggtcct gttaggacgg caggcaggcg caagtggaca gctggagggg 720 

ccatcctaca ccaacgctgg cctggacacc acggacctgt gacaggggcc cccactcttc 780 

tggacccctt gaagaggccc taccacaccc taactgcacc tgtctccctg gagatgaaaa 840 

tatatgacgc tgccctgcct cctgcttttg gccaatcacg gcagacaggg gttggggaaa 900 
tattttatt 905 

<210> 10 
<211> 510 
<212> PRT 
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<213> Homo sapiens 

Met°Arg°Pro Gly Leu Ser Phe Leu Leu Ala Leu Leu Phe Phe Leu Gly 
1 



5 10 15 



Gin Ala Ala Gly Asp Leu Gly Asp Val Gly Pro Pro lie Pro Ser Pro 



20 



25 



30 



Gly Phe Ser Pro Phe Pro Gly Val Asp Ser Ser Ser Ser Phe Ser Ser 
35 40 45 

Ser Ser Arg Ser Gly Ser Ser Ser Ser Arg Ser Leu Gly Ser Gly Gly 



50 



55 60 



Ser Val Ser Gin Leu Phe Ser Asn Phe Thr Gly Ser Val Asp Asp Arg 
65 10 

Gly Thr Cys Gin Cys Ser Val Ser Leu Pro Asp Thr Thr Phe Pro Val 

90 95 



85 



Asp Arg Val Glu Ara Leu Glu Phe Thr Ala His Val Leu Ser Gin Lys 
100 105 HO 

Phe Glu Lys Glu Leu Ser Lys Val Arg Glu Tyr Val Gin Leu lie Ser 



115 



120 



125 



Val Tyr Glu Lys Lys Leu Leu Asn Leu Thr Val Arg He Asp lie Met 



130 



135 



140 



Glu Lys Asp Thr lie Ser Tyr Thr Glu Leu Asp Phe Glu Leu lie Lys 
145 150 155 160 

Val Glu Val Lys Glu Met Glu Lys Leu Val lie Gin Leu'Lys Glu Ser 
165 I 70 175 

Phe Gly Gly Ser Ser Glu lie Val Asp Gin Leu Glu Val Glu lie Arg 
180 1" 190 

Asn Met Thr Leu Leu Val Glu Lys Leu Glu Thr Leu Asp Lys Asn Asn 
195 200 205 

Val Leu Ala lie Arg Arg Glu He Val Ala Leu Lys Thr Lys. Leu Lys 



210 



215 220 



Glu Cys Glu Ala Ser Lys Asp Gin Asn Thr Pro Val Val His Pro Pro 



225 



230 



235 
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Pro Thr Pro Gly Ser Cys Gly His Gly Gly Val Val Asn lie Ser Lys 
245 250 255 

Pro Ser Val Val Gin Leu Asn Trp Arg Gly Phe Ser Tyr Leu Tyr Gly 
260 265 270 

Ala Trp Gly Arg Asp Tyr Ser Pro Gin His Pro Asn Lys Gly Leu Tyr 
275 280 285 

Trp Val Ala Pro Leu Asn Thr Asp Gly Arg Leu Leu Glu Tyr Tyr Arg 
290 295 300 

Leu Tyr Asn Thr Leu Asp Asp Leu Leu Leu Tyr He Asn Ala Arg Glu 
305 310 315 320 

Leu Ara He Thr Tyr Gly Gin Gly Ser Gly Thr Ala Val Tyr Asn Asn 
325 330 335 

Asn Met Tyr Val Asn Met Tyr Asn Thr Gly Asn He Ala Arg Val Asn 
340 345 350 

Leu Thr Thr Asn Thr He Ala Val Thr Gin Thr Leu Pro Asn Ala Ala 
355 360 365 

Tyr Asn Asn Arg Phe Ser Tyr Ala Asn Val Ala Trp Gin Asp He Asp 
370 375 380 

Phe Ala Val Asp Glu Asn Gly Leu Trp Val He Tyr Ser Thr Glu Ala 
385 390 395 400 

Ser Thr Gly Asn Met Val He Ser Lys Leu Asn Asp Thr Thr Leu Gin 
405 410 415 

Val Leu Asn Thr Trp Tyr Thr Lys Gin Tyr Lys Pro Ser Ala Ser Asn 
420 425 430 



Ala Phe Met Val Cys Gly Val Leu Tyr Ala Thr Arg Thr Met Asn Thr 
435 440 445 

Arg Thr Glu Glu He Phe Tyr Tyr Tyr Asp Thr Asn Thr Gly Lys Glu 
450 455 460 

Gly Lys Leu Asp He Val Met His Lys Met Gin Glu Lys Val Gin Ser 
465 470 475 480 

He Asn Tyr Asn Pro Phe Asp Gin Lys Leu Tyr Val Tyr Asn Asp Gly 
485 490 495 
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Tyr Leu Leu Asn Tyr Asp Leu Ser Val Leu Gin Lys Pro Gin 
500 505 510 
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